The Traffic Scene Understanding and Prediction Based on Image Captioning
نویسندگان
چکیده
منابع مشابه
Segmentation-Based Urban Traffic Scene Understanding
We propose a method to recognize the traffic scene in front of a moving vehicle with respect to the road topology and the existence of objects. To this end, we use a two-stage system, where the first stage abstracts from the underlying image by means of a rough super-pixel segmentation of the scene. In a second stage, this meta representation is then used to construct a feature set for a classi...
متن کاملPhrase-based Image Captioning
Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representat...
متن کاملImage Holistic Scene Understanding Based on Image Intrinsic Characteristics and Conditional Random Fields
Image holistic scene understanding based on image intrinsic characteristics and conditional random fields is proposed. The model integrates image scene classification, image semantic segmentation and object detection. 1) For the scene classification, we use method of PHOW feature extraction plus KPCA dimensional reduction to obtain feature information for each image. 2) For object detection sec...
متن کاملImage Segmentation and Scene Understanding Project
1. Introduction Scene or image understanding deals with the problem of making a computer " understand " the world behind the image. This can be done in a number of different ways. In this project, we will deal with a kind of problem of scene understanding, semantic image segmentation or pixel labeling. Multi-class image segmentation or pixel labeling does more than the task of object recognitio...
متن کاملSupport surfaces prediction for indoor scene understanding
In this paper, we present an approach to predict the extent and height of supporting surfaces such as tables, chairs, and cabinet tops from a single RGBD image. We define support surfaces to be horizontal, planar surfaces that can physically support objects and humans. Given a RGBD image, our goal is to localize the height and full extent of such surfaces in 3D space. To achieve this, we create...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: 2169-3536
DOI: 10.1109/access.2020.3047091